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Introduction 


e |am James Hare 
e Wikidata contributor since 2014 


e Consultant in information management and 
product development more generally, with a focus 
on open graphs 


Started experimenting with standalone Wikibases In 
2015, back when you had to change hardcoded 
Wikidata-specific values 


What is Wikibase? = 


e Suite of MediaWiki extensions and sears 
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e Wikibase can describe "the real ma mae: 


reference URL https://api.crossref.org/wo 


world" in its complexity without no noroner tis 


8674%2800%2981683-9 


boxing you in to a strict schema ae ee 


based on heuristic interred from DOI 
database lookup 


+ add reference 


Wait, semantic graphs in MediaWiki? 


Where have | heard this before? 


* Itsounds like Semantic MediaWiki, (| wasn't in the room when the decision was 
but it's not quite that made to build something custom instead of 
ee using an existing solution so | can't really tell 

e Semantic MediaWiki is more general you about that, sorry) 


purpose, less opinionated 


e It augments regular wiki pages 
with structured content director / manager Brian Epstein value (in this case, another item) 
property start time 1960 
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Wikibase provides a baseline 


ere ; ae end time 1967 
Wikibase Ontology with entities 
(comprising Items and Properties) ~1reference reference 
connected through Statements. reference URL http:/www.brianepstein.com/ 


e The page structure, editing 
interface are provided for you 


So what's the hype? The Beatles (01209) 


e Wikidata uses it, and Wikidata is COOL 
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e Global identity broker linking to other 


» 0 references 


resources 
e Underlying data representation is human seal as 
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presentation in multiple languages 
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Wikidata as app backend 


e Use Wikidata as a common data 
backend 


e Use SPARQL and other open 
APIs to develop applications 


e Instead of siloing your app data 
in your database, you contribute 
back to the commons 


wikxhibit.org/ 


The topic in context 


infectious diseases acute respiratory dstross syndrom 


monoclonal antibody © symptoms and signs. 


WaPriet COMR.IG health specialty 


7 all 


antiviral ¢ 
sl ‘on focus list of Wikimedia project, 


instance of 
possible treatment, ~ 


os 


artificial respiration 


ee 


" Middle East respiratory syndrome: 
polyclonal antibodies, Glass of disease 


disease transmission process, 


Contact transmission, 


Re 


giarhea 


X 


fever 


SD Korea respiratory syndrome coronavirus outbreak. 
ry 5) exe eam 


dyspnea 
2018 Middle East respiratory syndrome outbreak 
2012 Middle East respiratory syndrome coronavirus outbreak 
Significant event — 


Middle East respiratory syndrome coronavirus, 


§ 
A 


‘acute viral respiratory tract infection 


Subclass of 


ifforont from ptopie’s main category. 


Ras natural reservoir nial 


Ill] Wikidata Query Service 


Recently published works on the topic 
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Lethal zoonotic coronavirus infections of humans — comparative 
phylogenetics, epidemiology, transmission, and clinical features of 
coronavirus disease 2019, The Middle East respiratory syndrome and 
severe acute respiratory syndrome 


The Middle East respiratory syndrome coronavirus (MERS-CoV) nucleic 
acids detected in the saliva and conjunctiva of some naturally infected 
dromedary camels in Saudi Arabia -2019 


particular disease. 
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severe acute respiratory syndrome // Middle East respiratory 
syndrome // ecological epidemiology 


Saudi Arabia // Middle East respiratory syndrome 


Wikidata? Wikibase? 


Wikidata is the largest and most notable Wikibase, but it is not the only one, 
nor should it be 


Wikidata community is overwhelmed with the large influx of semi- 
automated content additions 


Proper scope of Wikidata is an outstanding question 


More specialized Wikibases can go way deeper while still linking back to 
Wikidata in a hub-and-spoke model 


Common Wikibase ontology ties these databases together 


Should | use Wikibase as my data 
backend? 
Maybe. 


Reasons not to use Wikibase 


1. You have a small amount of data that is in 
Wikidata's scope 


e Wikidata has a broad scope and generally loose notability requirements 
e You need to create fewer than 100,000 new entries 
e New properties can always be requested 


e Or, the community will help you find a way to use existing properties to 
express the data 


e Just put it in Wikidata and let the community translate and augment your 
data 


2. You have a simple relational (table-like) 
data model 
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e Wikibase is a graph database 


Tuple { 


e Graph database let you express complex 
multi-directional relationships with several 
thousand attributes 


Relation 


e ...at significant cost to performance and Y 
ease-of-use Ss 
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e You are better served by a MediaWiki 
extension like Cargo 


e Or even a proprietary service like Airtable 


3. Your data is already cleanly structured, with 
no need for human edits 


Wikibase helps you turn unstructured data (like Wikipedia) into structured 
data 


If you have no need for this, you will find Wikibase to be a tedious 
intermediary, since you can only import as fast as a bot can edit 


e Unless you are directly manipulating the backend database 


Cut out the middle man and import your data directly into a database like 
Virtuoso or Neo4| 


e If necessary, use a transform process to model the data as Wikibase RDF. 


4. To visualize data 


e Wikibase Is not a data visualization tool 
e Itjustisn't 


e You will need other tools to make the data “friendlier” to consume 


Best uses for Wikibase 


e Adding structure to complex, unstructured data 
e With many different attributes, or attributes that don't apply to every entity 
e With complex multidirectional relationships between entities 


e And doing so with human collaborators 


Hybrid approaches 


Blending Wikidata with other solutions 

e Use aconventional database, then link back to Wikidata IDs with a database 
column 
e Then selectively incorporate data or contribute it back 


e Create a Wikibase where items link back to a more structured dataset via 
standard identifier 


e And then accept human contributions on top via Wikibase's editing interface 
e Query both datasets from a common query service 


e Common standards and the use of persistent identifiers enables this 


Learning more 


e https://wikiba.se 

e https://wikidata.org 

e james@scatterred 

e https://calendly.com/hare| 


